Post-processing based on Utterance Verification in Online Keyword Recognition for Multimedia Content Retrieval
نویسندگان
چکیده
In this paper, we propose an utterance verification-based postprocessing in online keyword recognition for multimedia content retrieval. The proposed post-processing technique verifies whether a candidate keyword segment can be categorized as a keyword. For this work, we employ a confidence measure based on the recognition results. In keyword recognition experiments, our approach achieved better performance than the conventional approach.
منابع مشابه
Confidence Measure for Utterance Verification in Keyword Spotting System
In this article, we propose an utterance verification technique for keyword spotting. The keyword spotting system analyzes a given spoken content and searches every speech segment in which one of pre-defined keywords is uttered. To maintain a stable recognition performance in the system, we propose an utterance verification technique that verifies whether a found utterance, or a candidate keywo...
متن کاملDiscriminative Utterance Verification For Connected Digits Recognition - Speech and Audio Processing, IEEE Transactions on
Utterance verification represents an important technology in the design of user-friendly speech recognition systems. It involves the recognition of keyword strings and the rejection of nonkeyword strings. This paper describes a hidden Markov model-based (HMM-based) utterance verification system using the framework of statistical hypothesis testing. The two major issues on how to design keyword ...
متن کاملUtterance Verification Using Prosodic Information for Mandarin Telephone Speech Keyword Spotting - Acoustics, Speech, and Signal Processing, 1999. Proceedings., 1999 IEEE International Conference o
In this paper, the prosodic information, a very special and important feature in Mandarin speech, is used for Mandarin telephone speech utterance verification. A two-stage strategy, with recognition followed by verification, is adopted. For keyword recognition, 59 context-independent subsyllables, i.e., 22 m s and 37 FINAL’S in Mandarin speech, and one backgroundkilence model, are used as the b...
متن کاملImproving Task Independent Utterance Verification Based on On-line Garbage Phoneme Likelihood
Utterance verification based on on-line garbage (OLG) models is often adopted as the benchmark method. However, we find its performance can be remarkably improved by fine-tuning. In this study, OLG phoneme likelihood is proposed. It achieves much better performance and efficiency for task independent utterance verification to reject mis-recognition and OOV utterances than the OLG frame likeliho...
متن کاملIntegration of phonetic and prosodic information for robust utterance verification - Vision, Image and Signal Processing, IEE Proceedings-
Mandarin speech is known for its tonal charactcristic, and prosodic information plays an important role in Mandarin speech recognition. Driven by this propcrty, phonetic and prosodic information are integrated and used for Mandarin telephone speech keyword spotting. A two-stage strategy, with recognition followed by verification, is adopted. For keyword recognition, 132 subsyllable models, two ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012